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1.  INTRODUCTION 

In  this  paper,  we  consider  the  Maximum  a-Posteriori  estimation  of  random 
fields  observed  via  nonlinear  sensors  in  multidimensional  white  noise.  Our  goal  is 
to  extend  the  one  dimensional  results  obtained  for  diffusion  processes  in  [1 1].  Due 
to  singularities  which  appear  in  the  case  of  non-Gaussian  random  fields,  we  defer 
the  treatment  of  those  to  a  companion  paper  ([2]). 

The  basic  model  we  will  deal  with  is  that  of  random  fields  generated  by  the 
solution  of  noise  driven  elliptic  PDE's:  loosely  speaking,  let  D  be  a  nice  bounded 
domain  in  Rd  and  let  P  be  a  strongly  elliptic  operator  of  order  2k,  with  Pg  an 
associated  boundary  operator,  the  field  model  we  consider  is 

j  Pu(x)  =  n(x)  xeD 

|P3u(x)=0  xe5D 

where  n  is  white  noise.  For  simplicity,  we  will  concentrate  on  the  Dirichlet 
problem. 

The  observation  model  will  be  that  of  white  noise  corrupted  nonlinear 
observations,  i.e. 

y  =  h(u)+n  (1.2) 

(an  exact  definition  of  the  model  involved  is  given  in  section  2  below). 

A  typical  example  of  the  model  is  the  following  problem  which  we  consider 
as  our  prototype  example.  It  motivates  our  study  since  it  seems  suitable  for  image 
analysis  applications. 

(P)  Au  -  a2u  =  n  xeD  =  [0,1]2 


u|  =0 
dD 

X1  x2 

y(xrx2)  =||  u(ei5  62)deid02  +  w(xrx2) 

0  0 

where  w(xi,x2)  is  a  Brownian  sheet  on  D  independent  of  n.  Note  that  in  (P),  k=l 
and  d=2. 
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Following  the  approach  of  [1 1],  we  define  the  "posterior  probability"  of  the 
process  u  given  the  observation  a-field  a{y}  as 

,  «,)=lim  P("u-4,||<e|eT{y})  (13) 

y  e->0  P(llull  <  e) 

where  II  II  denotes  the  sup  norm.  We  refer  to  Jy(<|>)  as  the  "conditional  Onsager- 
Machlup"  functional.  We  will  show  in  Section  3  that  for  suitable  <(>,  J(<|))  is  well 
defined,  at  least  for  almost  all  y  in  the  support  of  the  measure  generated  by  (1.2). 
Note  however  that  unlike  in  [1 1],  the  normalizing  constant  P(llull  <  e)  depends  on 
the  model  (1.1)  used,  although  not  on  <j>.  Actually,  in  the  problem  (P)  described 
above,  had  we  tried  to  define  Jy(<|>)  using  for  example  P(IIA*1nll  <  e)  as  the 
normalizing  constant,  the  limit  in  (1.3)  would  have  in  general  failed  to  exist,  even 
for  the  case  h(-)  =  0.  This  is  the  main  new  difficulty  in  the  d>l  case,  and  this  fact 
forces  us  to  treat  the  non-Gaussian  case  separately,  for  in  that  case  no  <|>- 
independent  normalizing  constant  can  be  found. 

Once  (1.3)  is  well  defined,  an  obvious  candidate  for  a  "sample"  MAP 
estimator  is 

$=argmax  Jy((J>)  (1.4) 

From  this  point  on,  the  treatment  is  similar  to  the  1-D  case:  the  existence  of 
estimators  is  proved  and  a  representation  result  derived.  In  the  case  of  linear  h(-),  a 
convexity  argument  yields  also  the  uniqueness  of  the  estimator.  When  specified  to 
our  prototype  problem  (P),  the  results  read: 

A2$  =  -h‘($)h($)  +  h'($)  y  in  D 

/  $  =  0  on  9D  (1.5) 

=  0  on  9D 
Idv 


and  the  term  h'(<|>)y  is  to  be  understood  in  the  Ogawa  sense,  c.f.  definition  3.1 
below. 
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Finally,  we  remark  that  we  have  not  tried  to  treat  the  most  general  case 
possible:  thus  we  consider  only  bounded  domains,  we  do  not  consider  pseudo¬ 
differential  operators  (which  for  d  such  that  is  not  an  integer  are  natural 

candidates,  since  A(d+1V4  creates  in  Rd  the  Levy  motion)  etc.  The  results 

concerning  existence  of  the  limit  J(<j>)  and  of  solutions  $  do  however  seem  to  carry 
over. 

The  organization  of  the  paper  is  as  follows:  in  section  2  below  we  define 
rigorously  our  model  as  well  as  the  notations  used.  In  section  3  we  define  the 
conditional  Onsager-Machlup  functional  and  prove  it's  existence.  Finally,  in 
section  4  the  estimation  problem  is  finally  attacked:  existence  and  representation 
results  are  derived. 
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2.  Model  Definition 

Let  D  be  a  closed  bounded  domain  in  Rd>  with  a  smooth  boundary  3D.  Let 
P  be  a  strongly  elliptic  differential  operator  of  order  2k  with  smooth  coefficients, 
and  let  Pg  be  the  boundary  operator  (of  order  k-1).  We  denote  by  B  the  Dirichlet 
form  associated  with  (P,Pa)  (c.f.  [1]). 

In  the  sequel,  Wm>2(D)  will  denote  the  usual  Sobolev  space  of  order  (m,2) 

based  on  D.  The  Sobolev  norm  in  Wm’2(D)  is  denoted  by  II  l!m,2  and  II  II2  denotes 

in  2 

the  usual  L2(D)  norm.  Wq  ’  (D)  will  denote  the  closure  of  C^(D)  w.r.t.  the  norm 

II  Hm,2-  WQm’^(D)  is  the  space  of  distributions  which  is  the  dual  of  W™’^(D).  We 

denote  by  n  the  random  distribution  valued  white  noise  in  D,  i.e.  the  random 
distribution  n  such  that  for  each  smooth  <J>e  C”(D),  n(<j>)  is  a  Normal  random 

variable  of  mean  zero  and  variance  Ik^.  Note  that  n  is  W'm’2(D)  valued  for 

m>d/2  [9]  and  that,  for  <|)e  (D)  and  any  basis  ei  of  L2(D), 

n(<M  =  S  a-O^e.)  (in  q.m.)  (2.1) 

1-1 

where  the  ai  in  the  R.H.S.  in  (2.1)  are  i.i.d.  N(0,1)  random  variables,  with  ai  = 
n(ei). 

By  a  solution  to  the  equation 

{Pu  =  n  inD 

(2.2) 

Pau  =  0  on  3D 

we  mean  a  distribution  valued  random  variable  u  such  that  u  has  a  continuous 
version  and  B[<|),u]  =  n(<)>)  for  all  (])  e  C~(D),  where  B  is  the  associated  Dirichlet 

form  [9].  Note  that  u  is  not  a  classical  solution  and  moreover  even  the  boundary 
conditions  need  not  be  classically  satisfied.  Only  if  u  e  ’^(D)  will  one  have  a 

classical  generalized  Dirichlet  problem  in  the  sense  of  [1,  ch.8]. 

We  can  show  the  following  basic  theorem: 


-6- 


(i  2 

Theorem  2.1:  Assume  that  2k  >  Further  assume  that  IB[<|),(|)]I  >  Clkj)^  2-  Then 

(2.2)  has  a  unique  L2(D)  solution  which  is  Holder  continuous  with  some  exponent 
>  0.  Moreover,  any  two  such  solutions  are  equal  in  the  sense  of  distributions  in 
WQk’2(D). 

Proof.  The  theorem  follows  by  an  easy  application  of  the  machinery  developed  in 

[1],  [9].  We  therefore  give  below  only  it's  sketch. 

k.2 

Uniqueness.  Let  (j)  e  WQ  and  let  P*\|/  =  <)>,  \j/  =  0  on  5D.  By  the  usual  theory 

2k  2 

PDE's,  \]/€  Wq  ’  (D).  LetUi,  U2  be  solutions  of  (2.2).  Then 


0  =  n(¥)  -  n(\)/)  =  Ux(P*v)  -  U2(P*y)  =  U^)  -  U2(<» 

k  2  -k  2 

Since  ())  is  arbitrary  in  Wq  ’  (D),  one  deduces  that  Ui  =  U2  in  Wq  ’  (D). 

Existence.  We  show  the  existence  only  for  the  case  of  formally  self  adjoint 
operators,  the  general  case  requiring  a  different  but  similar  construction.  We  first 
quote  the  following  lemma,  which  is  a  combination  of  theorems  (16.5)  and  (15.1) 
in  [1]. 

k 

Lemma  2.1.  Let  {e^ ,  Ai}  denote  the  generalized  eigenfunctions  and  the 
eigenvectors  associated  with  B,  respectively,  i.e. 

(P  -  U)ke*  =  0 ,  ie  J,  k  =  l,2,...,k(i)  (2.3) 

Then 

(a)  J  is  countable  and  k(i)  is  finite  for  all  i. 

(b)  Let  N(A)  be  the  number  of  eigenvalues  (counting  multiplicty),  such 
that  Re(X-i)  <  X.  Then 

N(^)  =  cXmk  +  0(Xdl2k) 

(Note  that  (b)  implies  that  when  arranged  by  increasing  size.  A*  ~  i2k^d). 

(c)  ek,  ieZ+,  k=l,2,...,k(i)  span  L2(D). 


(2.4) 
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Note  that  the  assumptions  of  the  theorem  also  guarantee  that  lA-il  >  c.  Let 
now  aj;k  be  the  i.i.d  N(0,1)  r.v.  given  by  a^k  =  n(e^ ).  Let  Ai  be  the  k(i) 

dimensional  matrix. 

k.  1  0 
1 

•  1  . 

0  'k. 

1J 


and  bijc  the  k-th  element  of  the  vector 


(A;) 


'a..  ' 

i.i 


lai>k(i)j 


We  claim  that 

»-X 

ieZ+  k=l 

Note  that  E(llull|)  =  E(b?k)  <  C  <  <*>,  and  therefore  ueL2(D).  It 

i.k  i 

is  easy  to  check  that  u  does  indeed  satisfy  (2.2),  and  the  existence  of  a  continuous 
version  follows  from  Kolmogorov's  criteria.  We  ommit  the  details. 

We  conclude  this  section  by  defining  the  observation  model:  let  h(  ):R  — » R 
be  a  C2k+1  function  with  all  derivatives  up  to  order  2k+l  bounded.  Let  n  be  a  d- 
dimensional  white  noise,  independent  of  n.  Note  that 

xi  xd 

w(x15...,xd)  =  J  ...  |  n(61,...,ed)d6 1 ...  d0d 
o  o 

is  a  Brownian  sheet.  We  define  the  observation  model  y(x),  xeD  as 


(2.5) 
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xi  xa 

y(20  =  |  •  •  •  |  h(u(01,...,0d))dfi  +  w(x)  (2.6) 

o  o 

Note  that  as  in  [10],  the  measures  generated  by  y(x)  and  by  w(x),  xeD  are 
equivalent  with  Radon-Nikodym  derivative: 

^  =  exp(J h(u(0))dw(0)  h2(u(0))d0)  (2.7) 

w  D  D 

where  the  first  integral  in  (2.4)  is  a  stochastic  integral  which  is  well  defined  since 

u(x)  is  independent  of  o{w(0),  0  €  D}.  We  refer  below  to  (2.7)  as  the  "likelihood  ratio". 
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3.  Prior  Density  -  The  Onsager-Machlup  Functional  For  Gaussian  Random  Fields 
We  recall  that  in  [3],  [8],  the  following  definition  was  introduced  for  the 

2 

prior  density  of  a  diffusion  process  satisfying  dxt  =  f(xt)dt  +  dwt  with  f  a  given 
function: 


I(4>)  =  lim 

e  — >  0 


P(llx  -  <|)||  <  e) 
P(llwlke) 


(3.1) 


where  II  II  denotes  the  sup  norm  (here,  in  [0,1],  say).  Note  that  the  denominator  is 
a  "standard  reference"  and  does  not  depend  on  f. 

For  <|>  e  C1+a,  a>0,  one  can  show  ([13])  that  in  the  1-D  case, 


log  1(0)  =  ■ J  ($  -  f(0))2dt  -  j  Jf ’(<|>)dt  (3.2) 

1  0  0 

For  linear  f,  the  second  term  in  the  RHS  of  (3.2)  is  a  constant.  It's  origin  can  be 
traced  to  the  fact  that  in  the  Radon-Nikodym  derivatives  between  the  two  measures 
in  the  RHS  of  (3.1),  a  term  of  the  form  Jf(<())  wdw  =  Jf  (<]))  — ~  appears;  under 


the  conditioning  of  w  small,  the  dw2  term  drops  and  one  recovers  (3.2). 

A  natural  approach  to  the  multidimensional  case  would  try  to  compute 
(3.1).  However,  here  instead  of  J  f  (<|>)wdw,  one  would  have  a  term  of  the  form 
Jf '  ((())P-1(n)n.  Proceeding  formally,  it  is  easy  to  check  that  this  term  is  infinite, 
even  under  the  conditioning  of  llwll  small.  We  postpone  the  treatment  of  the  case  of 
nonlinear  drifts  to  [2].  For  Gaussian  fields  u  generated  by  P,  however,  one  has: 
Theorem  3.1: 


Leae^APfiMfce) 

P(llulke) 


■.  Then 


a) 


J(0)  =  lim 
£->0 


Je(<J>)  exists  for  4>  e  W 


4k,2 

(Q  which  satisfy  the  boundary  conditions 


p,*-a 
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b)  J(<j>)=exp-i-J(P<»)2 

D 

Proof.  Let  A  denote  the  Radon-Nikodym  derivative  between  the  measure  induced 
by  u  and  that  induced  by  u-<|).  One  then  has  [7]: 


Note  now  that 


(3.3) 


Je(<|>)  =  E0(aI|IuII  <  e) 

where  Eo  denotes  expectations  w.r.t.  the  measure  defined  by  <|)  =  0.  Therefore,  to 
prove  the  theorem  we  have  to  show  that  for  <|)  e  w4k>2(D), 

E0(exp  J-  (P<J))nl  llulke)e4°  1  (3.4) 

D 

Note  that  since  P<j>  e  W21^2  (D)  L2(D),  and  is  deterministic  (and  in  general, 

independent  of  o(n))  the  stochastic  integral  in  (3.4)  is  well  defined.  Moreover, 

J  P<|)  n  =  J  P<|>Pu 

D  D 

=  J  P*(P(j))u  (3.5) 

D 

However,  \jr  =  P*P<j)  e  L2(D)  is  independent  of  u.  One  has  therefore  I^j(P*P<|))u  I 

<  kliullli\|/ll2  from  which  (3.4)  follows. 

As  was  the  case  in  [1 1],  theorem  (3.1)  will  not  be  quite  enough,  for  the 
optimal  estimate  will  turn  out  not  to  be  in  W4k>2(D).  In  order  to  define  the  class  of 
functions  in  which  the  estimator  exists,  we  need  to  introduce  some  new  machinery, 
which  is  reminiscent  the  "Ogawa  integration"  in  1-D  (c.f.  [4],  [6]): 

Definition  3.1.  Let  <|>  e  L2(D)  be  random  and  generated  by  the  white  noise 
n.  Let  ei  be  an  arbitrary  orthonormal  base  in  L2(D)  which  satisfies  the  boundary 
conditions  of  (2.2).  Recall  that  n  can  be  represented  as  n=  £ajei  in  the  sense  of 

l 
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distributions  in  W"d’2(D)  where  ai  are  i.i.d.  N(0,1)  r.v.,  and  ai  =  ^ejn  is  well 

defined  since  ei  is  deterministic. 

If  the  sum 


(3.6) 


converges  in  L2  (Q)  and  its  value  does  not  depend  on  the  choice  of  the  basis  e;,  we 
say  that  <j)  is  Osawa  inteerable  and  denote  its  integral  I  =  ^|(J)on. 


It  is  easy  to  see  that  if  <|>  e  Wq’^(D),  (j)  is  Ogawa  integrable;  actually,  we 


can  have  more: 

Lemma  3.1.  Let  u  e  L2(D)  be  a  deterministic  bounded  function.  Let  K  e 

2k  2 

Wq  ’  (D)  be  a  random  function  generated  by  ff ,  with  2k  >  d/2.  Then 


J  (uKon)  =  J  P*(K)P_1(u n)  (3.7) 

D  D 

where  P_1(uff)  denotes  the  unique  L2(DxQ)  solution  to  the  S.P.D.E. 


J  Pv  =  un  (3.8) 

W  =  ° 

which  exists,  and  the  equality  (3.7)  is  in  the  sense  that  each  side  exists. 

Proof.  Assume  first  that  a  unique  solution  to  (3.8)  exists  in  L2(DxQ).  By  our 
assumptions,  the  R.H.S.  of  (3.7)  exists  as  a  r.v.  in  L2(Q). 

Let  now  ei  denote  an  L2(D)  basis  which  satisfies  the  boundary  conditions 
associated  with  P.  One  has  that: 

XffuKeij  fj  ei°"|  =  Z([P*<K> pl  K'jfJ  'i0" 
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M 

r  p*(k)  ^ 

P_1(ue.)  f(e.on) 

D  i=1 

w 

D 

(3.9) 


By  our  assumption  on  the  existence  of  a  unique  L2(Dx£2)  solution  to  (3.8),  one 
checks  easily  that  the  sum  in  the  R.H.S.  of  (3.9)  converges  in  L2(DxQ)  to  such  a 
solution,  and  (3.7)  is  proved. 

It  remains  therefore  to  show  that  a  unique  L2(QxD)  solution  to  (3.8)  exists. 
Note  that  if  a  solution  which  satisfies  the  boundary  conditions  exists  then  it  is 
unique,  since  if  vi  and  V2  are  two  such  solutions  P(vi-V2)  =  0  and  vi-V2  =  0  by  the 
classical  PDE  theory.  We  therefore  proceed  to  construct  a  solution: 

Let  (Xi,  \|/j)  be  the  eigenvalues  and  the  generalized  eigenfunctions  associated 
with  P.  For  simplicity,  we  assume  that  the  eigenvalues  have  all  single  geometric 
multiplicity  -  the  general  case  follows  easily  since  by  [1]  the  eigenvalues  have 
always  finite  multiplicity.  Note  that  since  u  is  bounded,  u\j/k  e  L2(D)  and  one  has 


uYk(x)  =  2  bk  V*  (x) 

i 

where  b£  =  ^u\|/k\|/£.  Let  ak=  ^j\|/kon.  Define 


c<= 

M  k  ) 


Note  that 


i  i  Xf  k 


However,  b£  =  ^ju\|/kV|/£  therefore 
XK)  =fu2V?W£||ull2<c 
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Therefore,  E^jjj  <  c^^  —  <  <*>  for  2k  >  d/2.  Define  now 
i  i  X] 


v  =  2i,C[\\f[(x) 
c 

Clearly,  v  e  L2(D).  It  is  easy  to  check  that  v  such  defined  is  a  solution  to  (3.8). 
The  proof  is  completed. 

We  are  ready  now  to  state  the  extension  of  Theorem  (3.1)  which  we  will 
need  in  the  sequel. 

Theorem  3.2. 

Let  <j)  e  W2k’2(D)  be  random  and  generated  by  some  white  noise  n 
independent  of  n.  Assume  that  <()  satisfies  the  boundary  conditions  associated  with 
P.  Let  \j/  =  PcJ)  e  L2(D)  and  assume  that 


P*\jr  =  A(<|>,\|/)  +  K(<)>)oh  a.s. 


(3.10) 


,2k. 


where  A(<j),\j/)  e  L2(D)  and  K(-)  e  (R  —>  R),  where  by 
that  for  any  g  e  C°°(D),  ^|\j/Pg  =  ^|gK((|))on  +  ^A((J>,\}t)g. 
of  theorem  (3.1)  still  holds  a.s.  in  n. 


a  solution  we  mean 
Then  the  conclusion 


Proof.  We  have  only  to  show  that 


EQ(exp  J \j/  onlllull  <  e,n)£->  1  a.s.  n 

D 

Note  that  by  lemma  (3.1)  (with  u=l,  K  =  (P*)TA(<t>,\j/)),  the  A(4>,xj/)  term  in  (3.10) 
does  not  cause  any  difficulty.  Therefore,  we  actually  need  to  show  that 

|i£  =  EQ(exp  J  (j/  onlllull<8,n)£:40l  a.s.  n 

D 


where 
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P*\j/  =  K(()))on 


Indeed, \jr  =  \j /  -  (P*)~1A(<|>,\)/).  Note  that 


jj.e  >  exp  Eo(^jtjf  onlllull  <  e,ff)  =  1 


(3.11) 


due  to  the  symmetry  in  n  on  the  conditioning  set  Hull  <  e.  On  the  other  hand,  let  C[ 
be  the  eigenfunction  expansion  of  P,  which  spans  L2(D),  and  assume  for  simplicity 
that  all  eigenvalues  are  simple  (the  general  case  does  not  pose  any  difficulty).  Then 

f  \jr  on  =  ^  (\j/,  e.)(e.,n)  =  ^  (P*\j/,  P'1e.)(P*ei,  P'*n) 

D  i=1  i=1 

=  ]£  (P*V,  e.)(e.,u)  =  ]£  (P *v,  (e.,u)e.)  =  fuK(<}))on 

i=l  i=l  D 

(3.12) 


where  we  have  used  the  fact  that  ei  g  (D)  and  u  is  bounded.  Applying  lemma 
3.1,  we  have  therefore  that 

J  \jr  on  =  J  P*(K(<|>))  P"  1(un) 

D  D 

Therefore, 

lJ\jronl<C||P'1(un)ll2  (3.13) 

D 

Note  that  0  =  P_1(un)  is  conditioned  on  u  a  Gaussian  process  with 
2 

EOlSiyu)  <  Cllull2  converging  to  zero  as  Hull  -4  0  (see  lemma  3.1).  Therefore, 


for  any  c>0 
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En(jiclllull  <  e) 
u  £ 

<  EQ(exp  cj  \\f  onllluli  <  e) 

D 

<  EQ(exp  c1ll6||2Illull  <  e) 

<  E0(EQ(exp  Cj  1101  l2|u)  I  Hull  <  e) 

<  EQ(exp  c2llull2lllull  <  e) 

<  exp  cue2— >  1  (3.14) 

£— >o 

Combining  (3.14)  with  (3.11)  one  obtains  the  theorem. 

In  the  sequel,  we  say  that  <(>  e  L  if  <j>  satisfies  the  conditions  of  theorem 

(3.2). 
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4.  MAP  Estimator 

We  start  by  finally  defining  our  MAP  estimator:  Let  u  denote  the  solution  of 
(2.2)  and  let  y  denote  the  observation  defined  by  (2.6).  Define 

je  (n  a  PflMWqfr}) 
y  *  P(llull<e) 

We  make  the  following  definition: 


Definition  (4.11: 

Let  Jy(<j>)  =  ^lixr^J^ (<[)),  and  assume  a  (not  necessarily  unique)  solution  exists 
to  the  following  stochastic  optimization  problem 


: arg max  J(<J>) 
<}>e  jS 


(4.1) 


Then  <p  is  called  the  MAP  estimator  of  u  given  a{y}. 


Remark.  As  we  will  see  below,  a  version  of  Jy(<J>)  can  be  defined  for  all  y  in  the 
support  of  the  law  of  y  defined  by  (2.6).  Therefore,  the  optimization  problem  of 
definition  (4.1)  is  well  posed.  It  will  also  hold  true  that  the  optimization  of  the 
expression  we  have  for  Jy(<(>)  over  a  space  larger  than  L  (specifically,  over  Wq^(D)) 


still  yields  a  solution  in  L .  It  therefore  justifies  the  fact  that  we  look  for  a  solution 
in  L  to  start  with. 

In  order  to  compute  Jy(<t>),  we  start  with  the  following  lemma,  whose  proof 
we  defer  to  the  appendix: 


Lemma  (4.1) 

HP 

Let  — ^  be  as  in  (2.7).  Then 
dPw 
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2(<K0))d0  (4.2) 

where  the  stochastic  integral  in  the  RHS  of  (4.2)  is  well  defined  for  all  <j>  e 
W2k>2(D)  and  W  in  the  support  set  of  the  measure  defining  y  by  the  pairing  between 
W-d/2-8»2(D),  (5<2k-d/2),  and  W2k>2(D)  (i.e.,  it  is  well  defined  even  for  <)> 
stochastic  which  depends  on  y). 

Combining  now  lemma  (4.1)  and  theorem  (3.2),  one  has  the  following  easy 
corollary. 


Corollary  4.1. 

V<|)  e  L, 


Jy  (0)  i  log  Jy(<J>)  =  [  J  h(<j)(0))dy(0)  -  i  J  h2((K0))d0  -  j  J  (P(^(0))2d0] 
odd 

(4.3) 

Having  defined  the  cost  functional  ty(<j)),  we  can  turn  to  the  existence  issue. 
We  claim: 


Theorem  4.2. 

Assume  that  the  conditions  of  theorem  2.1  hold  together  with  h  e  C2k+1(R) 
2k 

and  h'  e  Cq  (R). 

Then  a  solution  to  the  problem 
$=  arg  max  J  V(<J>) 

exists.  Moreover,  in  the  case  h(x)  is  linear,  this  solution  turns  out  to  be  uniqe. 


Proof.  The  proof  follows  closely  the  lines  of  [12].  We  proceed  in  two  steps: 
a)  We  show  that  for  each  ye  C(D), 


lim  ?y(<j))  =  -oo,  Jy(0)*-oo 
’■2*2 


00 
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b)  We  show  that  ty(<j))  is  lower  semi  continuous  w.r.t.  the  weak 
topology  in  W2k>2(D). 

Note  that  a)  and  b)  imply  the  first  part  of  the  theorem,  for  by  a)  there  exists 

for  each  ye  C(D)  a  number  R(y)<°°  such  that  the  supremum  of  ty((|))  is  achieved 
inside  a  ball  of  radius  R(y)  in  W^^fD).  This  ball  being  weakly  compact,  b)  then 
implies  that  the  supremum  is  actually  achieved.  Note  that  due  to  the  strict  convexity 

in  <])  of  ty(<|>)  when  h(-)  is  linear,  the  second  part  of  the  theorem  follows  once  the 
first  part  is  proved. 

We  turn  now  to  the  proof  of  a):  Note  first  that  ty(0)  =  Jh(0)dy  -  j 

h2(0)Vol(D)  >  -«*>.  Due  to  the  ellipticity  of  P  and  our  assumptions  on  P,  there 
exists  a  ci  >  0  such  that 

J<p«|)))2de a (CjiiA^nl  -  c2ii«22)v<^ 

D 

On  the  other  hand, 

I  JhWdyl  £  llyll^  M  IlhfWII^  2  <  c4(y)(ll<t>ll2  +  IIAk$ll2) 

D 

with  C4(y)  <  where  8  <  (2k-d/2).  Let  IIAk<f)]l2  =  x,  1I0II2  =  z.  Note  that  ll<t>H2k,2  ^ 
C(x+z);  one  has  then 


J  (0)  <  -((c1x2  -  c2z2)vc3z2)  +  c4(x+z) 


X+Z  — $°° 

_ ^  — co  t 


and  the  proof  of  a)  is  completed. 

We  finally  show  b).  Note  first  that  since  2k>d/2,  weak  convergence  in 
W^CD)  implies  strong  convergence  in  L2(D)  and  therefore  the  second  integral  in 
the  R.H.S.  of  (4.3)  is  weakly  continuous.  Considering  the  first  integral,  note  that 
it  is  defined  by  the  pairing  <W'2k’2,  W2k’2>  and  since  dy  e  W‘4/2“5/2(D) 
W-2k,2(D),  the  weak  continuity  follows  immediately.  Finally,  considering  the  third 
0) 

integral,  let  <[>n  -»<|)  in  W2k>2(D),  then  clearly  B[<j>n,  ej]  — >  B  [4>,ei]  where  ei  is  any 
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member  of  the  orthonormal  basis  associated  with  P  and  B  in  the  associated  Dirichlet 
form.  But 


lim  inf  IIP(|)n||2  =  lim  inf  X^e/ 

n— n->=  i  =  1 


=  lim  inf  (B[<t>n,  e.])2 

n— »<*>  i=l 

>2,  lim  inf(B[<j)n, e.  ])2 

i=l  n— >oo 

=  £  (BW.ej])2  =  IIP<WI2  (4 

where  we  used  Fatous'  inequality.  This  completes  the  proof  of  the  theorem. 

We  conclude  this  section  by  the  following  representation  result  for  the 

estimator  <j): 

Theorem  4.3: 

Let  k  be  an  integer,  k>d/2.  Any  maximizer  of  (4. 1)  is  a  weak  solution  of 


P*P$  =  -h’($)h($)  +  h'($)oy 

onD 

p9$=o 

on  3D 

(4.4) 

P  JP$  =  0 

.  d 

on  3D 

where  P^denotes  the  boundary  operator  defined  by  P*  i.e.,  for  <|>i,  <j>2  e  C°°(D) 
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2k- 1 

(!**♦,,  v ^ = S  cp ;  Vi  Tt 

j=i  dn 


with  nbeing  the  exterior  normal  to  D,  P^  above  defined  by  Green's  formula  and 
(P^l)  =  [(Pg<t>l)k,  (P^^Ok+ri-.CP^^Oak-l]- 

The  proof  of  theorem  4.3  follows  by  an  easy  application  of  the  necessary 
conditions  of  the  calculus  of  variations  and  is  therefore  ommitted.  Note  that  by 
lemma  3.1  we  can  understand  (4.4)  as  a  pathwise  equation  defined  for  each 

ve  CfDi!  Note  also  that  by  (4.4),  $e  L. 

Remark:  Note  that  by  [5],  the  Ogawa  integral  in  (4.4)  may  be  replaced  by  a 
Stratonovich  integral,  by  using  a  Haar  basis.  This  basis  also  yields  an 
approximation  to  (4.4)  by  means  of  difference  equations. 

Appendix 


Clearly,  all  we  have  to  show  is  that 


E_  (exp  f  (h«|>(Q))  -  h(u(fi)))  oy(0))lll<|)-ull  <  e,y) 


£ — >0 
->  1 


(A.1) 


Let  u8(0)  =  u(0)*j8(0),  <J)8(0)  =  <t>(0)*j5(0)  where  j8(0)  is  a  8(e)  molifier 
(say,  a  Poisson  kernel).  We  will  choose  8(e)  ^  -^0  below.  Note  that  (A-l)  will  be 

proved  if  we  can  show  that 


(I1  =  exp(cj"(h(<t>)  -  h(<j>5))oy) 


8(e)->0 

1 


(A.2a) 


p2  =  E  _  (exp(c  f(h(<J)5)  -  h(u5))oy)lll<|>-ull  <  e,y) 

w  * 


S(e)->0 

0  1,  VceR 


(A.2b) 


p3  =  E_  (exp(c  f(h(u8)  -  h(u))oy)lll<j)  -  ull  <  e,y) 

w  J 


S(e) — >0 

0  _>  1,  VceR 


(A.2c) 
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Consider  first  (A-2a):  Note  that  since  (J)e  W2k’2(D), 


,8->0 


llh((j))  -  h((j>S)ll2k,2  ^  c8  — >0  which  implies  (A-2a).  Consider  next  (A-2b):  Note 

g 

that  llh(<J>5)  -  h(u8)lbv  ?  <  c  ll<|>5  -  u8ll2k,2  ^  c  -xr-.  By  choosing  8‘2k  =  o(e_1), 

8 


one  obtains  (A-2b). 

We  finally  consider  (A.2c):  Since  under  EL  ,  y  is  white  noise  and  u  is 

w 

independent  of  y,  one  has: 


Vc,  Icl  >  1  E 


,((^3)c)<E_ 


exp  c'llh(u)  -  h(u5)ll~ 


6(e)->0 

<  exp(c  ’82)  — »  1 


which  concludes  the  proof  of  the  lemma. 
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